ON THE LENGTH OF BINARY FORMS 

BRUCE REZNICK 

Q I Abstract. The K-length of a form / in K[xi, . . . , Xn], K C C, is the smallest 

O^ ' number of d-th powers of linear forms of which / is a X-linear combination. We 

present many results, old and new, about _R'-length, mainly in n = 2, and often 
about the length of the same form over different fields. For example, the X-length 
of 3a;^ - 20a;3y2 + iQxy* is three for K ^ Q(\/^), four for K = Q(\/^) and five 
for K = R. 
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Suppose /(xi, ...,Xn) is a form of degree d with coefficients in a field K C C. The 



2 ' K -length of f, Lxif), is the smallest r for which there is a representation 

r 
(1.1) f{xi,...,Xn) = ^\j{ajiXi H l-ajnXnY 



•^ . with A,-, a,fc £ -^- 

In this paper, we consider the i^-length of a fixed form f a.s K varies; this is 
i/S ' apparently an open question in the literature, even for binary forms (n = 2). Sylvester 

t^^ ■ [IHillQ] explained how to compute Lc{f) for binary forms in 1851. Except for a few 

O I remarks, we shall restrict our attention to binary forms. 

It is trivially true that Lxif) = 1 for linear / and for d = 2, Lxif) equals the 
rank of /: a representation over K can be found by completing the square, and this 
length cannot be shortened by enlarging the field. Accordingly, we shall also assume 
that d > 3. Many of our results are extremely low-hanging fruit which were either 
c^ I known in the 19th century, or would have been, had its mathematicians been able to 

take 21st century undergraduate mathematics courses. 

When K = C, the A/s in (II. ip are superfiuous. The computation of Lc{f) is a 
huge, venerable and active subject, and very hard when n > 3. The interested reader 
is directed to [I[5l[l0l[l2l[13l[17l[20l[23l[29l[39lll0llllj as representative recent 
works. Even for small n,d > 3, there are still many open questions. Landsberg and 
Teitler [29] complete a classification of Lc{f) for ternary cubics / and also discuss 
Lc{xiX2- ■ ■ Xn), among other topics. Historically, much attention has centered on 
the C-length of a general form of degree d. In 1995, Alexander and Hirschowitz 
PP (see also [21 ED) established that for n,d > 3, this length is \^{''^t^^)], the 
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constant-counting value, with the four exceptions known since the 19th century - 
{n,d) = (3,5), (4,3), (4,4), (4,5) - in which the length is [^(l^l')] + 1. 

An alternative definition would remove the coefficients from (II. ip . (The com- 
putation of the alternative definition is likely to be much harder than the one we 
consider here, if for no other reason than that cones are harder to work with than 
subspaces.) This alternative definition was considered by Ellison [H] in the special 
cases /C = C, R, Q. When d is odd and K = H, the A/s are also unnecessary. When 
d is even and /r C R, it is not easy to determine whether ( 11. ip is possible for a 
given /. In [42], the principal object of study is Qn,2k, the (closed convex) cone of 
forms which are a sum [Xj = 1) of 2A;-th powers of real linear forms. As two illus- 
trations of the difficulties which can arise in this case: a/2 is not totally positive in 
K = Q(v^), so ^/2 x^ is not a sum of squares in K[x\, and x^ + QXx'^y^ + y'^ G (52,4 
if and only if A G [0, 1]. For more on the possible signs that may arise in a mini- 
mal R- representation (II. ip . see |45J. Helmke [20J uses both definitions for length for 
forms, and is mainly concerned with the coefficient-free version in the case when K 
is an algebraically closed (or a real closed) field of characteristic zero, not necessarily 
a subset of C. Newman and Slater [51] do not restrict to homogeneous polynomials. 
They write x as a sum of d d-th powers of linear polynomials; by substitution, any 
polynomial is a sum of at most d d-th powers of polynomials. They also show that 
the minimum number of d-th. powers in this formulation is > yd. Because of the 
degrees of the summands, these methods do not homogenize to forms. Mordell [32] 
showed that a polynomial that is a sum of cubes of linear forms over Z is also a 
sum of at most eight such cubes. More generally, if i? is a commutative ring, then 
its d-Pythagoras number, Pd{R), is the smallest integer k so that any sum of d-th 
powers in i? is a sum of k d-th powers. This subject is closely related to Hilbert's 
17th Problem; see [HI El [Z] • 

Two examples illustrate the phenomenon of multiple lengths over different ffelds. 

Example 1.1. Suppose f{x,y) = {x + \/2yY + {x — \/2yY G Q[x,y]. Then Lj^if) is 
2 (if \/2 G K) and d (otherwise). This example first appeared in [121 P-137]. (See 
Theorem 4.6 for a generalization.) 

Exam,ple 1.2. If (f){x,y) = 3x^ — 20x^y'^ + lOxy^, then Lk{4>) = 3 if and only if 
^/^ G K, LKi(j)) = 4 for ir = Q(v/^), Q(y^), Q(v/^), Q(y^) (at least) and 
Ln{4') = 5. (We give proofs of these assertions in Examples 2.1 and 3.1.) 

The following simple definitions and remarks apply in the obvious way to forms 
in n > 3 variables, but for simplicity are given for binary forms. A representation 
such as (II. ip is called K-minim,alii r = Lx{f)- Two linear forms are called distinct 
if they (or their d-th powers) are not proportional. A representation is honest if the 
summands are pairwise distinct. Any minimal representation is honest. Two honest 
representations are different if the ordered sets of summands are not rearrangements 
of each other; we do not distinguish between t'' and {Cd^Y where C,d = e^'^'^l'^. 
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If g is obtained from / by an invertible linear change of variables over K^ then 
Lxif) = Lxig)- Given a form / G C[x,2/], the field generated by the coefficients of 
/ over C is denoted Ef. The iiT- length can only be defined for fields K satisfying 
Ef C K C C. The following implication is immediate: 

(1.2) K^CK2 =^ LKAf)>LKAf)- 

Strict inequality in (11.21) is possible, as shown by the two examples. The cabinet of 
/, C(/) is the set of all possible lengths for /. 

We now outline the remainder of the paper. 

In section two, we give a self-contained proof of Sylvester's 1851 Theorem (Theorem 
2.1). Although originally given over C, it adapts easily to any K (Z C (Corollary 
2.2). If / is a binary form, then Lx{f) < r iff a certain subspace of the binary forms 
of degree r (a subspace determined by /) contains a form that splits into distinct 
factors over K. We illustrate the algorithm by proving the assertions of lengths 3 
and 4 for in Example 1.2. 

In section three, we prove (Theorem 3.2) a homogenized version of Sylvester's 
1864 Theorem (Theorem 3.1), which implies that if real / has r linear factors over 
R (counting multiplicity), then i^R(/) > r. In particular, -Lr,(0) = 5. As far as we 
have been able to tell, Sylvester did not connect his two theorems: perhaps because 
he presented the second one for non-homogeneous polynomials in a single variable, 
perhaps because "fields" had not yet been invented. 

We apply these theorems and some other simple observations in sections four and 
five. We first show that if Lc(/) = 1, then Lg (/) = 1 as well (Theorem 4.1). Any set 
oi d + 1 d-th powers of pairwise distinct linear forms is linearly independent (Theorem 
4.2). It follows quickly that if f{x,y) has two different honest representations of 
length r and s, then r + s > d + 2 (Corollary 4.3), and so if LEf{f) = r < ^^, then 
the representation over Ef is the unique minimal C- representation (Corollary 4.4). 
We show that Example 1.1 gives the template for forms / satisfying Lc{f) = 2 < 
LeAJ) (see Theorem 4.6), and give two generalizations which provide other types 
of constructions (Corollaries 4.7 and 4.8) of forms with multiple lengths. We apply 
Sylvester's 1851 Theorem to give an easy proof of the known result that Lc{f) < d 
(Theorem 4.9) and a slightly trickier proof of the probably- known result that Lxif) < 
d as well (Theorem 4.10). Theorem 4.10 combines with Theorem 3.2 into Corollary 
4.11: if / € R[a;, I/] is a product of d linear factors, then Lji{f) = d. Conjecture 4.12 
asserts that / G R[x, y] is a product of d linear factors if and only if -^r(/) = d. 

In Corollary 5.1, we discuss the various possible cabinets when d = 3,4; and give 
examples for each one not already forbidden. We then completely classify binary 
cubics; the key point of Theorem 5.2 is that if the cubic / has no repeated factors, 
then Lk{f) = 2 if and only ii Ef{^y—3A{f)) C K; this significance of the discriminant 
A(/) can already be found e.g. in Salmon [ITJ §167]. This proves Conjecture 4.12 for 
d = 3. In Theorem 5.3, we show that Conjecture 4.12 also holds for d = 4. Another 
probably old theorem (Theorem 5.4) is that Lc{f) = d ii and only if there are 
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distinct linear forms £, £' so that / = i'^~^i'. The minimal representations of x'^y^ are 
parameterized (Theorem 5.5), and in Corollary 5.6, we show that Lk{{x^ + y'^)'') > 
k + 1, with equality if and only if tan -^^ G K. In particular, Lq((x^ + y^)^) = 4. 
Theorem 5.7 shows that Lq(x^ + GXx^y"^ + y^) = 3 if and only if a certain quartic 
diophantine equation over Z has a non-zero solution. 

Section six lists some open questions. 

We would like to express our appreciation to the organizers of the Higher Degree 
Forms conference in Gainesville in May 2009 for offering the opportunities to speak 
on these topics, and to write this article for its Proceedings. We also thank Mike 
Bennett, Joe Rotman and Zach Teitler for helpful conversations. 



2. Sylvester's 1851 Theorem 

Modern proofs of Theorem 2.1 can be found in the work of Kung and Rota: ^28, §5], 
with further discussion in f25\ [2S1 EZl SI] • We present here a very elementary proof 
showing the connection with constant coefficient linear recurrences, in the hopes that 
this remarkable theorem might become better known to the modern reader. 

Theorem 2.1 (Sylvester). Suppose 

(2-1) f(-^y) = t{'^ 



ttjx'^ ^y^ 



j=0 



and suppose 
(2.2) 






i=0 



is a product of pairwise distinct linear factors. Then there exist Ajt G C so that 



(2.3) 

if and only if 

(2.4) 



fi^^ y) = Y1 ^k^^kx + (3kyY 



fc=i 






02 






\ctd-r Cld-r+1 



ad J 



\cr) 





Vo/ 



that is, if and only if 

r 

(2.5) y^ ae+tCt = 0, 



0, 1, . . . , d — r. 



t=o 
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Proof. First suppose that (12 .Sp holds. Then for < j < d, 



ttj 



fc=l i=0 fc=l t=0 

r r r 



fc=l t=0 fc=l 



Now suppose that (12.41) holds and suppose first that c^ ^ 0. We may assume 
without loss of generality that c,. = 1 and that Uj = 1 in (12. 2p . so that the /3j's are 
distinct. Define the infinite sequence (oj), j > 0, by: 



r-l 



(2.6) Oj = aj if < j < r — 1; 0,,+^ = — 2, ^t+iCt for £ > 0. 

i=0 

This sequence satisfies the recurrence of (12.50 . so that 

(2.7) cij = Oj for j < d. 

Since |aj| < c ■ M-' for suitable c, M, the generating function 

oo 

i=o 
converges in a neighborhood of 0. We have 

^ r \ r— 1 / " \ oo / r \ 

^ c._,T* $(T) = 5^ 5^ c._(„_,)a, T" + ^ K^ c._,a._, T". 

,t=0 / n=0 \i=0 / n=r \t=0 J 

It follows from (12.60 that the second sum vanishes, and hence $(T) is a rational 
function with denominator 

r r 

J2cr-tr = h{T,l)=l[{l-(5,T). 
t=0 i=l 

By partial fractions, there exist A^ G C so that 

oo '' \ *" 

(2.8) Yl ~^^T' = *(^) = E i^ir ^ ^^- = E ^^/^^ 

A comparison of (12. 8p and (12. 7p with (12. ip shows that 

(2.9) fix, y) = Y {^^ ^3^''y' = E ^^ E f"^) ^^^'"'^' = E ^^(^ + ^^y^^ 

j=0 ^J^ k=l j=0 ^■' '' fc=l 

as claimed in (12. 3p . 
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If Cr = 0, then Cr_i 7^ 0, because h has distinct factors. We may proceed as before, 
replacing r by r — 1 and taking Cr-i = 1, so that fl2.2p becomes 



r-l 



r-1 



(2.10) 



t=0 j=l 

Since Cr = 0, (12. 4p loses a column and becomes 



X). 



( «0 

ax 



a\ 

0-2 



■ ■ ■ O'T^A 



lc,\ 



\ad-T fld-r+l ■ ■ ■ d'd-x) 

We argue as before, except that ( 12. 7p becomes 



C\ 



\Cr-l) 



/o\ 



Vo/ 



(2.11) 

and (12. 9p becomes 



for j < c? — 1, 



o-d — o,d + \ 



f{x,y) = Y^ 



(2.12) 



i=o 



ajx'^ ^y^ 



A./+i:^'i:(') 

fc=l j=0 ^"'^ 



/^^.x'^-^y^- 



Ar/ + J]Afc(x + /3fcl/)' 



k=l 



By ( I2.10p . (I2.12P meets the description of (12. 3p . completing the proof. 



D 



The (d-r + l) X (r + l) Hankel matrix in ([23!) will be denoted Hr{f). If (/, h) 
satisfy the criterion of this theorem, we shall say that h is a. Sylvester form for f. If 
the only Sylvester forms of degree r are Xh for A G C, we say that h is the unique 
Sylvester form for /. Any multiple of a Sylvester form that has no repeated factors 
is also a Sylvester form, since there is no requirement that A^ 7^ in (12. 3p . If / has 
a unique Sylvester form of degree r, then Lc{f) = r and Lxif) > r. 

The proof of Theorem 2.1 in [H] is based on apolarity. If / and h are given by 
(EH) and (122]), and /i(D) = 11^=1 (/^i^ - «j^)' then 



d-r 



h{D)f = J2 



d\ 




^d-r-ruym 



id — r — m)\m\ , 

m=0 ^ ^ \i=0 

Thus, ( 12. 4p is equivalent to h{D)f = 0. One can then argue that each linear factor 
in h{D) kills a different summand, and dimension counting takes care of the rest. 
In particular, if degh > d, then h{D)f = automatically, and this implies that 
Lcif) < c? + 1. Theorem 4.2 is a less mysterious explanation of this fact. 

If h has repeated factors, a condition of interest in [251 ISSl [271 [2H1 HI], then Gun- 
delfinger's Theorem |l8j, first proved in 1886, shows that a factor (/3x — ayY of h 
corresponds to a summand {ax + PyY^^~^q{x,y) in /, where q is an arbitrary form 
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of degree i — 1. (Such a summand is unhelpful in the current context when i > 2.) 
There is a lengthy history of the connections with the Apolarity Theorem in [H]. 

li d = 2s — 1 and r = s, then Hs{f) is s x (s + 1) and has a non-trivial null- vector; 
for a general /, the resulting form h has distinct factors, and so is a unique Sylvester 
form. (The coefficients of h, and its discriminant, are polynomials in the coefficients 
of /.) This is how Sylvester proved that a general binary form of degree 2s — 1 is 
a sum of s powers of linear forms and the minimal representation is unique. (If so, 
LkH) = s iS h splits in K, but this does not happen in general if s > 2.) 

li d = 2s and r = s, then Hs{f) is square; det(ifs(/)) is the catalecticant oi f. (For 
etymological exegeses on "catalecticant", see [l^l pp. 49-50] and [T?! pp. 104-105].) In 
general, there exists A so that the catalecticant of f{x,y) — Ax^* vanishes, and the 
resulting non-trivial null vector is generally a Sylvester form (no repeated factors). 
Thus, a general binary form of degree 2s is a sum of Ax^* plus s powers of linear 
forms. (It is less clear whether one should expect Lxif) = s -|- 1 for general /.) 

Sylvester's Theorem can be adapted to compute i^-length when K C C. 

Corollary 2.2. Given f G K[x,y], Lxif) is the minimal degree of a Sylvester form 
for f which splits completely over K . 

Proof. If (12.31) is a minimal representation for / over K, where Xk,ak,(^k ^ K, then 
h{x,y) G K[x,y] splits over K by (12. 2p . Conversely, if /z is a Sylvester form for / 
satisfying (12. 2p with a^, (3k G K, then (12. 3 p holds for some A^ G C. This is equivalent 
to saying that the linear system 



(2,13) 



E 

k=l 



VPix, 



(0 < J < d) 



has a solution {X^ = Xk} over C. Since aj,^;, "'/S;^ G K, it follows that (I2.13P also 
has a solution over K, so that / has a /^-representation of length r. D 

We apply these results to the quintic from Example 1.2. 
Example 2.1 (Continuing Example 1.2). Note that 



Since 



{x, y) = 3x^ - 20x'^y^ + lOxy 



+ 



l)-3x'+(J].0x^ 



3 0-2 0' 

0-202 

-2 2 0, 



fco\ 

Cl 



;-2) xY + (f) ■ X V + (f 1 ■ 2 xy" 



Oy' 



(co,Ci,C2,C3) = r(0, 1,0,1), 
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has a unique Sylvester form of degree 3: h{x,y) = yi^x^ + y"^) = y{y — ix){y + ix). 
Accordingly, there exist A^ G C so that 

0(x, y) = Xix^ + A2(x + lyf + X^ix - iyf. 

Indeed, Ai = A2 = A3 = 1, as may be checked. It follows that Lk{(P) = 3 if and only 
a i G K. (A representation of length two would be detected here if some A^ = 0.) 
To find representations for of length 4, we revisit (12. 4p : 

^4(0) ■ (co, ci, C2, C3, C4)* = (0, 0)* ^^ 3co - 2c2 + 2c4 = -2ci + 2c3 = 
^^ (co, ci, C2, C3, C4) = ri(2, 0, 3, 0, 0) + r^iO, 1, 0, 1, 0) + r^iO, 0, 1, 0, 1), 

hence h{x,y) = rix^(2a;^ + 3y'^) + yi^x"^ + y'^){r2X + r^y). Given a field f^, it is far 
from obvious whether there exist {r^} so that h splits into distinct factors over K. 
Here are some imaginary quadratic fields for which this happens. 

The choice (ri,r2,r3) = (1,0,2) gives h{x,y) = {2x'^ + ?/^)(x^ + 2?/^) and 

240(x, y) = 4(a; + y/^yf + 4(x - V^yf + (2a; + y/^yf + {2x - ^f^yf. 

Similarly, (ri,r2,r3) = (2,0,9) and (2,0,-5) give h{x,y) = (x^ + 3|/^)(4x^ + 3?/^) 
and (x^ — ?/^)(4x^ + 5?/^), leading to representations for of length 4 over Q{^/^) 
and Q(V— 5). The simplest such representation we have found for Q{y/^6) uses 
(ri,r2,r3) = (8450,0,-104544) and 

h{x, y) = {5x + 12y){5x - 12y){6 ■ U'^x'^ + 33^^). 

It is easy to believe that -^q(^/3^)(0) = 4 for all squarefree m > 2, though we have no 
proof. In Example 3.1, we shall show that there is no choice of (ri,r2,r3) for which 
h splits into distinct factors over any subfield of R. 

3. Sylvester's 1864 Theorem 

Theorem 3.1 was discovered by Sylvester [50j in 1864 while proving Isaac Newton's 
conjectural variation on Descartes' Rule of Signs, see [221 |5T]. This theorem appeared 
in Polya-Szego [371 Ch.5,Prob.79], and has been used by Polya and Schoenberg [36] 
and Karlin [211 P-466]. The (dehomogenized) version proved in [37] is: 

Theorem 3.1 (Sylvester). Suppose 7^ A^ for all k and 71 < ■ • • < 7r, ''" > 2, are 
real numbers such that 

r 

k=l 

does not vanish identically. Suppose the sequence (Ai, . . . , A^, (— l)'^Ai) has C changes 
of sign and Q has Z zeros, counting multiplicity. Then Z <C. 

We shall prove an equivalent version which exploits the homogeneity of / to avoid 
discussion of zeros at infinity in the proof. (The equivalence is discussed in 
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Theorem 3.2. Suppose f{x,y) is a non-zero real form of degree d with r real linear 
factors (counting multiplicity) and 

r 

(3.1) f{x,y) = ^A,(cos^,x + sm%)'^ 

i=i 
where —\<9i<---<6r<\,r>2 and Aj 7^ 0. Suppose there are a sign changes 
in the tuple (Ai, A2, . . . , A,., (— l)'^Ai). Then r < a. In particular, t < r. 

Example 3.1 (Examples 1.2 and 2.1 concluded). Since 

^{x,y) = 3x {x^ - i^y2) {x' - i^y^) 

is a product of five linear factors over R, Lr(0) > 5. The representation 

60(x, y) = 36x^ - 10(x + y)^ - 10(x - yf + {x + 2yf + (x - 2yf. 

over Q implies that C(0) = {3,4,5}. 

Proof of Theorem 3.2. We first "projectivize" (13. ip : 

r 

2/(x, y) = y2 Aj(cos 9jX + sin 9jyY+ 

(3.2) 

^(-l)''Arf(cos(^j + 7r)x + sin(^j + 7r)yY 
i=i 

View the sequence (Ai, A2, . . . , A^, (— l)'^Ai, (— 1)'^A2, . . . , (— l)'^Ar., Ai) cyclically, iden- 
tifying the first and last term. There are 2a pairs of consecutive terms with a negative 
product. It doesn't matter where one starts, so if we make any invertible change of 
variables (x, y) i-)- (cos6'x + sin 9y, — sin 6*0; + cos 9y) in (13. ip (which doesn't affect r, 
and which "dials" the angles by 9), and reorder the "main" angles to (— f , f ], the 
value of 0" is unchanged. We may therefore assume that neither x nor y divide /, that 
x"^ and y'^ are not summands in (13. 2p (i.e., 9j is not a multiple of |), and that if there 
is a sign change in (Ai, A2, . . . , A^), then 9u < < 9u+i implies A^Au+i < 0. Under 
these hypotheses, we may safely dehomogenize / by setting either x = 1 or y = 1 
and avoid zeros at infinity and know that r is the number of zeros of the resulting 
polynomial. The rest of the proof generally follows [37] . 

Let a denote the number of sign changes in (Ai, A2, . . . , A^). We induct on a. The 
base case is a = (and Xj > without loss of generality). If d is even, then a = 
and 

r 

fi^^ y) = Yl ^j(^^^ ^j^ + sin 9jyf 
is definite, so r = 0. If ci is odd, then a = 1. Let g{t) = f{t, 1), so that 



g'{t) = y^ d {Xj cos 9j) (cos 9jt + sin 9j' 



d-l 
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Since d — 1 is even, cos 6j > and Xj > 0, g' is definite and g' ^ 0. RoUe's Theorem 
implies that g has at most one zero; that is, r < 1 = a. 

Suppose the theorem is vahd for ci = m > and suppose that a = m + 1 in fl3.ip . 
Now let h{t) = f{l,t). We have 

r 

h'it) = ^d (Aj sin Oj) (cos 6^ + sin O^tY-^. 
i=i 
Note that h'{t) = q{l,t), where 

r 

g(x, y) = y^ d {Xj sin 9j) (cos 6jX + sin 9jyY~^. 
i=i 
Since a > 1, 6'm < < 6u+i implies that A„A„+i < 0, so that the number of sign 
changes in (rfAi sin6'i, (iA2sin^2, • • • , c^Ar-sin^r) is m, as the sign change at the w-th 
consecutive pair has been removed, and no other possible sign changes are introduced. 
The induction hypothesis implies that q{x,y) has at most m linear factors, hence 
^(1,^) = h'{t) has < m zeros (counting multiplicity) and Rolle's Theorem implies 
that h has < m + 1 zeros, completing the induction. D 

4. Applications to forms of general degree 

We begin with a familiar folklore result: the vector space of complex forms f in n 
variables of degree d is spanned by the set of linear forms taken to the d-th power. 
It follows from a 1903 theorem of Biermann (see [l2l Prop. 2. 11] or [l6j for a proof) 
that a canonical set of the "right" number of d-th powers over Z forms a basis: 

(4.1) {(iixi + ... + inXnY : < ifc G Z, i^ + ■ ■ ■ + i^ = d} . 

If / G K[xi, . . . , Xn], then / is a i^-linear combination of these forms and so Lx{f) < 
("^_^ ). We show below (Theorems 4.10, 5.4) that when n = 2, the bound for Li^{f) 
can be improved from d + 1 to d, but this is best possible. 

The first two simple results are presented explicitly for completeness. 

Theorem 4.1. /// G K[x,y], then Lxif) = 1 if and only if Lcif) = 1- 

Proof. One direction is immediate from (11.21) . For the other, suppose f{x,y) = 
{ax + (3yY with a,/3 G C. If a = 0, then f{x,y) = (3'^y'^, with (3^^ e K. If a 7^ 0, 
then /(x, y) = a^{x + {l3/a)yY- Since the coefficients of x'^ and dx'^~^y in / are a'^ 
and a'^~^/3, it follows that a'^ and (3 /a = {a'^~^ (3) / a^ are both in K. D 

Theorem 4.2. Any set {{ajX + PjyY : < j < rf} of pairwise distinct d-th powers 
is linearly independent and spans the binary forms of degree d. 

Proof. The matrix of this set with respect to the basis (Jx*^"*?/* is [q;^~*/3*], whose 
determinant is Vandermonde: 

aj Pj 



n 

0<j<k<d 
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This determinant is a product of non-zero terms by hypothesis. D 

By considering the difference of two representations of a given form, we obtain 
an immediate corollary about different representations of the same form. Trivial 
counterexamples, formed by splitting summands, occur in non-honest representations. 

Corollary 4.3. // / has two different honest representations : 

s t 

(4.2) f{x,y) = J2 ^^(«^a; + f3,yY = 5^/i,(7,x + S,yY, 

i=i j=i 

then s + t > d + 2. Ifs + t = d + 2in fl4.2p . then the combined set of linear forms, 
{aiX + f3iy, 'jjX + Sjy}, is pairwise distinct. 

The next result collects some consequences of Corollary 4.3. 

Corollary 4.4. Let E = Ej. 

(1) IfLEif) = r<l + l, then Ldf) = r, so C(/) = {r}. 

(2) //, further, LE^f) = r < f + |, then f has a unique C-minimal representation. 

(3) If d = 2s — 1 and Hs{f) has full rank and f has a unique Sylvester form h of 
degree s, then L^if) > s, with equality if and only if h splits in K . 

Proof. We take the parts in turn. 

(1) A different representation of / over C must have length > d+2 — r > | + 1 > r 
by Corollary 4.3, and so Lc{f) = r. 

(2) If r < I -|- |, then this representation has length > | + | > r, and so cannot 
be minimal. 

(3) lid = 2s — l and r = s, then the last case applies, so / has a unique C-minimal 
representation, and by Corollary 2.2, this representation can be expressed in 
K if and only if the Sylvester form splits over K. 

D 

We now give some more explicit constructions of forms with multiple lengths. We 
first need a lemma about cubics. 

Lemma 4.5. If f is a cubic given by (12.1 p and Il2{f) = I ) has rank < 1, 

\ai 02 a^J 

then f is a cube. 

Proof. If gq = 0, then ai = 0, so 02 = and / is a cube. If Oq 7^ 0, then 02 = a^/oo 
and 03 = aia2/ao = af/al and /(x, y) = ao(a; + —yY is again a cube. D 

Theorem 4.6. Suppose d>3 and there exist ai,Pi E C so that 

(4.3) fix,y) = J2 ('^)a,x^-V = (aix + P,yY + icy2X + M'' E K[x,y]. 

If (14. 3 p is honest and LK^f) > 2, then there exists u E K with -^/u ^ K so that 
L^(^)(/) = 2. The summands in (14.31) are conjugates of each other in K{^/u). 
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Proof. First observe that if 0:2 = 0, then a2/3i 7^ «i/32 iniphes that ai 7^ 0. But then 
ao = af ^ and ai = af~^/3i imply that af , (3i/ai G -ft' as in Theorem 4.1, and so 

fix,y) - atix + {Ma{)yY = {^^yf = Ph' e K[x,y]. 

This contradicts L^if) > 2, so 0^2 7^ 0; similarly, ai 7^ 0. Let Aj = a^ and 7j = /3j/aj 
for i = 1,2, so A1A2 7^ and 71 7^ 72. We have 

f{x, y) = Ai(x + 7ii/)'^ + A2(x + 72?/)'^ ^ a^ = Ai7i + ^272- 

Now let 

g{x, y) = Ai(x + 71?/)^ + A2(x + 72?/)^ = oox^ + Saix'^y + 3a2xy'^ + a^y^. 

Since Aj 7^ and (14.31) is honest. Corollary 3.5 implies that Lc{g) = 2, so H2{g) has 
full rank by Lemma 4.5. It can be checked directly that 

and this gives h{x, y) = [y — lix){y — 72a;) as the unique Sylvester form for g. Since 
H2{g) has entries in K and hence has a null vector in K, we must have h G K[x, y]. By 
hypothesis, h does not split over K; it must do so over K{y^), where u = (71 — 72)^ = 
(71 + 72)^ — 47x72 G K. Moreover, if a denotes conjugation with respect to y/u, then 
72 = cr(7i) and since Ai + A2 G K, A2 = cr(Ai) as well. Note that Aj = af and 
7i = /3j/tti £ -ft'lv^)' but this is not necessarily true for aj and /3j themselves. D 

Corollary 4.7. Suppose g G i?[x,|/] does not sj)/zt over E, but factors into distinct 
linear factors g{x,y) = YYj=i{x + ajy) over an extension field K of E. If d > 2r — 1, 
then for each i > 0, 

r 

Mx,y) = ^a^jix + ajyY e E[x,y], 
i=i 

andLxifi) = r <d + 2-r < LE{fi). 

Proof. The coefficient of [f^x^'^^y^ in fg is J2^j=i ^^j^^ ■ Each such power-sum belongs 
to E by Newton's Theorem on Symmetric Forms, li as ^ E (which must hold for at 
least one a^ 7^ 0), then af(x + asyY ^ E[x,y]. Apply Corollary 4.3. D 

Corollary 4.8. Suppose K is an extension field of Ef, r < ^^, and 

r 

/(x,i/) = ^Aj(aiX + /3jy)'^ 

i=l 

with Xi,ai,f3i G K. Then every automorphism of K which fixes Ef permutes the 
summands of the representation of f . 
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Proof. We interpret a{X{ax + l3yY) = a{X){a{a)x + a{(5)yY. Since a{f) = /, the 
action of a is to give another representation of /. Corollary 4.4(2) implies that this 
is the same representation, perhaps reordered. D 

This next theorem is undoubtedly ancient, but we cannot find a suitable reference. 

Theorem 4.9. If f E K[x,y\, then Lc{f) < degd. 

Proof. By a change of variables, which does not affect the length, we may assume 
that neither x nor y divide /, hence aoUd 7^ and h = adx'^ — a^y'^ is a Sylvester form 
which splits over C. D 

Theorem 4.9 appears as an exercise in Harris |T9l Ex. 11. 35], with the (dehomoge- 
nized) maximal length occurring at x'^~^{x + 1) (see Theorem 5.4). Landsberg and 
Teitler [29| Cor. 5.2] prove that Lc{f) < ("^_7 ) ~i^^^)^ which reduces to Theorem 
4.9 for n = 2. 

The proof given for Theorem 4.9 will not apply to all fields K, because aax'^ — a^y'^ 
usually does not split over K. A more careful argument is required. 

Theorem 4.10. If f e K[x,y], then LkU) < degd. 

Proof. Write / as in 02. ip . If / is identically zero, there is nothing to prove. Other- 
wise, we may assume that /(1,0) = Oq 7^ after a change of variables if necessary. 
By Corollary 2.2, it suffices to find /i(x, y) = J2k=o CkX'^^^y'^ which splits into distinct 
linear factors over K and satisfies 'Ylik=(d^kCk = 0. 

Let Co = 1 and efc(ti, . . . ,td_i) denote the usual fc-th elementary symmetric func- 
tions. We make a number of definitions: 

d-l d-l 

hoiti, ..., td-1] X, y) := ^ efc(ti, . . . , td-i)x'^~^~^y'' = JJ(x + tjy), 

fc=o i=i 

d-l 

/3(^i, • • • , td-i) '■= — 2_^ ctfcefc(ti, . . . ,td- 

k=0 
d-l 

Oi{ti, . . . , td-i) := > afc_|_iefc(ti, . . . ,td- 



-1), 



fe=0 



d-l 



$(ti, . . . , td-i) := n("(^i' • • • ' ^rf-i)^i " /^(^i' • • • ' ^'i-i))' 
^{tu...,td-i):=Htu...,td-i) n (ti-tj). 

l<i<j<d-l 

Then /3(0, ...,0) = -aoCo = -cto ¥" 0, so <l>(0,...,0) = a^'^ ^ and $ is not 
the zero polynomial, and thus neither is \E'. Choose 7^ G -ft', 1 < j < (i — 1, so 
that ^1^(71, . . . ,7d-i) 7^ 0. It follows that the 7j's are distinct, and 0:7^ 7^ /3, where 
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a = a(7i, . . . , 7d_i) and /3 = /3{ji, ..., 7d_i). Let e^ = 6^(71, . . . , 'Jd-i)- We claim 
that 

d d-l 

H^^ y) = ^ Cix'^'^y' := (ax + /3|/)/io(7i, • • • , 7rf-i; x, v) = ("a; + l3y) JJ(x + 7^?/) 

i=0 j=l 

d-l d-l 

= (ax + (3y) Y^ eux'^-^-^y^ = aeox'^ + ^(ae^ + (3ek-i)x'^-%'' + /Je^.i/ 

fc=0 fc=l 

is a Sylvester form for /. Note that the 7j's are distinct and a'^j j^f3,l<j<d — 1, 
so that h is a product of distinct linear factors. Finally, 

d d-1 

y^ flfcCfc = aeotto + ^(acfc + (3ek-i)ak + (3ed-iak = 

k=0 k=l 

d-1 d-1 

a 

fc=0 k=0 

This completes the proof. D 



: ^ efcflfc + /3 X] ^kO-k+i = tt(-/3) + /3a = 0. 



Corollary 4.11. If f is a product of d real linear forms, then -^r(/) = d. 

Proof. Write / as a sum of -^r(/) = r < d d-th powers and rescale into the shape 
(13.11) . Taking r = d in Theorem 3.2, we see that d < a < r. D 

Conjecture 4.12. If f E I{.[x,y] is a form of degree d > 3, then -^r(/) = d if and 
only if f is a product of d linear forms. 

We shall see in Theorems 5.2 and 5.3 that this conjecture is true for d = 3,4. 

5. Applications to forms of particular degree 

Corollary 4.3 and Theorem 4.10 impose some immediate restrictions on the possible 
cabinets of a form of degree d. 

Corollary 5.1. Suppose deg/ = d. 

(1) IfLcif) = r, then C{f) C{r,d-{r- 2),d-{r - 3), . . .,d}. 

(2) IfLcif) = 2, then C{f) is either {2} or {2,d}. 

(3) /// has k different lengths, then d >2k — 1. 

(4) /// is cubic, thenC{f) = {1}, {2}, {3} or {2,3}. 

(5) /// IS quartic, then C{f) = {1}, {2}, {3}, {4}, {2,4} or {3,4}. 

We now completely classify Lxif) when / is a binary cubic. 

Theorem 5.2. Suppose f{x,y) G Ef[x,y] is a cubic form with discriminant A and 
suppose Ef C K C C. 

(1) If f is a cube, then LE^{f) = 1 and C{f) = {1}. 
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(2) // / has a repeated linear factor, hut is not a cube, then Lxif) = 3 and 

C{f) = {3}. 

(3) // / does not have a repeated factor, then Lk{p) = 2 if ^/— 3A G K and 
Lk{p) = 3 otherwise, so either C{f) = {2} or C{f) = {2,3}. 

Proof. The first case follows from Theorem 4.1. In the second case, after an invertible 
linear change of variables, we may assume that f{x,y) = Sx'^y, and apply Theorem 
2.1 to test for representations of length 2. But 

(5.1) (? J o) ■ [llj = (2) ^ Co = ci = 0, 

so h has repeated factors. Hence Lk^x^v) > 3 and by Theorem 4.10, Lxix^y) = 3. 
Finally, suppose 

3 

/(x, y) = aox^ + Saix'^y + 3a2xy'^ + a^y^ = Y\_i''^j^ + ^jV) 

i=i 
does not have repeated factors, so that 

0^A(/) = J](r,Sfc-r,s,)2, 
j<k 

and consider the system: 

ao ai a2\ I ^\ ^ /O 
ai 02 as) I M \0 

By Lemma 4.5, this system has rank 2; the unique Sylvester form is 

h{x, y) = (aifla — al)x'^ + (aia2 — o,o<^3)xy + (00^2 — (^\)y^-, 

which happens to be the Hessian of /. Since h G Ef[x,y] C K[x,y], it splits over K 
if and only if its discriminant is a square in K. A computation shows that 

^2 ,.„ „ 2^,„ „ ., A(/) 3A(/) 



(aia2 ~ OoQ-s) ~ 4(aia3 — 02) (00*^2 ^ Oi) 



27 92 

Thus, Lj^if) = 2 if and only if a/— 3A(/) G iT. If /i does not split over F, then 
Lj,(/) = 3 by Theorem 4.10. D 

In particular, x^, x^ + y^, x^y and {x + iyY+{x — iy)^ have the cabinets enumerated 
in Corollary 5.1(4). If / has three distinct real linear factors, then A(/) > 0, so 
Y^— 3A(/) ^ R and -^r(/) = 3. If / is real and has one real and two conjugate 
complex linear factors, then A(/) < 0, so i^R(/) = 2. Counting repeated roots, we 
see that if / is a real cubic, and not a cube, then L-^{f) = 3 if and only if it has three 
real factors, thus proving Conjecture 4.12 when d = ?>. 
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Example 5.1. We find all representations of Sx'^y of length 3. Note that 

^3(/) ■ (co,Ci,C2,C3)* = (0) ^^ ci = ^=^ h{x,y) = cox-^ + C2xy'^ + csy^. 

If Co = 0, then y"^ \ h, which is to be avoided, so we scale and assume Cq = 1. We 
can parameterize the Sylvester forms h{x, y) = {x — ay){x — by){x + (a + b)y) with 
a, 6, — (a + b) distinct. This leads to an easily checked general formula 

3(a - 6)(a + 26) (2a + b)x'^y = 

(a + 2b){ax + yf - (2a + b){bx + yf + (a - 6)(-(a + b)x + y)l 

It is not hard to find analogues of (15.21) for (i > 3; we leave this to the reader. 



Theorem 5.3. If f is a real quartic form, then L^if) = 4: if and only if f is a 
product of four linear factors. 

Proof. Factor ±/ as a product of k positive definite quadratic forms and 4 — 2k linear 
forms. If fc = 0, then Corollary 4.11 implies that L-£i{f) = 4. We must show that if 
k = 1 or k = 2, then / has a representation over R as a sum of < 3 fourth powers. 

If A; = 2, then / is positive definite and by [381 Thm.6], after an invertible linear 
change of variables, f{x,y) = x^ + GXx'^y'^ + y^, with 6A G (—2,2]. (This is also 
proved in [l6].) If r 7^ 1, then 

{rx + yY + (x + ryY — {r^ + r)(x + yY 
^^■^^ = (r - l)\r' + r + l){x'- (,^) x^ + y') . 

Let 0(r) = — p2_['^^_i - Then 0(— 2 + -\/3) = 2 and 0(1) = —2, and since is continuous, 

it maps [—2 + -\/3, 1) onto (—2, 2], and (15. 3p shows that -^r(/) < 3. 

If A; = 1, there are two cases, depending on whether the linear factors are distinct. 
Suppose that after a linear change, f{x,y) = x'^h{x,y), where h is positive definite, 
and so for some A > and linear i, h{x,y) = Xx"^ + £^. After another linear change, 

(5.4) fix, y) = x\2x^ + I2y^) = [x + yf + (x - yf - 2y\ 

and (15.41) shows that L-[i{f) < 3. 

If the linear factors are distinct, then after a linear change, 

f{x, y) = xyi^ax^ + 2bxy + cy"^) 

where a > 0, c > 0, 6^ < ac. After a scaling, /(x, y) = xy{x'^ + dxy + y'^), \d\ < 2, and 
by taking ±/(x, ±?/), we may assume c? G [0, 2). If r 7^ 1, then 

{r^ + l)(x + yf — {rx + yf — (x + ryf 
^^'^^ = 4(r - Ifir' + r + 1) (x^ + {^^) xW + xy') . 

Let ^p{r) = 2(r'2+r+i) • Since V'(~l) = 0, V'(l) = 2 and ^p is continuous, it maps [—1, 1) 
onto [0, 2), and ([ESD shows that Lr(/) < 3. D 
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The next result must be ancient; Lc{x ^y) = d seems well known, but we have 



not found a suitable reference for the converse. Landsberg and Teitler 
show that Lc{x"'y^) = max(a + 1, 6 + 1) if a, 6 > 1. 



Cor.4.5] 



Theorem 5.4. If d > 3, then Lc{f) 
forms i and f so that f = i'^-H' . 



d if and only if there are two distinct linear 



Proof. If / = i'^~^i', then after an invertible linear change, we may assume that 
f{x,y) = dx'^'^y. If Lc{dx'^~^y) < d — 1, then / would have a Sylvester form of 
degree d — 1. But then, as in (15.11) . (12.41) becomes 



/Co \ 

Cl 



Co = Cl = 0, 



d. 



so h does not have distinct factors. Thus, Lc{dx'^~^y) 

Conversely, suppose Lc{f) = d. Factor / = Yl^'P ^s a product of pairwise 
distinct linear forms, with Y^rrij = d, mi > m2 ■ ■ ■ > mg > 1, and s > 1 (otherwise, 
Lcif) = 1-) Make an invertible linear change taking (^1,^2) '~^ {x,y), and call the 
new form g; Lc{g) = d as well. If g{x,y) = X]£=o {ij^i^'^^^y^^ then bo = bd = 0. By 
hypothesis, there does not exist a Sylvester form of degree d — 1 for g. Consider 



If nil ^ ^TT'2 ^ 2, then x 






bi 




bd- 


-2 bd-i 


bi 


b2 




bd- 


1 


x' 


.y' 


g{x 


y) 


and 61 = 



/Co \ 

Cl 



\Cd-lJ 



M-1 



and x'^ ^ — y'^ ^ is a Sylvester 



form of degree d — 1 for /. Thus m2 = 1 and so y"^ does not divide g and 61 7^ 0. Let 
q{t) = J2iZo ^«+i^* (note the absence of binomial coefficients!) and suppose g(to) = 0. 
Since g(0) = 61, to 7^ 0. We have 



bi 
bi &2 



bd~2 
bd-i 



bd-i 




to 



toq{to] 
qito) 



Since 



d-i 



Hx,y) = Y^ 



IqX 



d-l- 



Y 



ty 



d-l 



x-toy 



Hix - Cl.toy) 



i=0 ""^ fc=l 

has distinct linear factors, it is a Sylvester form for g, and Lc{g) < d — 1. This con- 
tradiction implies that q has no zeros, and by the Fundamental Theorem of Algebra, 
g(t) = 61 must be a constant. It follows that g{x,y) = dbix'^~^y, as promised. D 



18 BRUCE REZNICK 



By Corollaries 4.4 and 5.1, instances of the first five cabinets in Corollary 5.1(5) 
are: x^, x^ + y^, x^ + y^ + {x + y)^, x^y and (x + iy)^ + (x — iy)^. It will follow from 
the next resuhs that C{{x^ + y'^f) = {3,4}. 

Theorem 5.5. If d = 2k and f{x,y) = [\)x^y^, then Lc{f) = k + 1. The minimal 
C -representations of f are given by 

(5.6) {k + 1) (^^)xV = i2^Ck+2^x + Qi+2^''y?\ ^ «; G C. 

Proof. We first evaluate the right-hand side of (15. 6 p by expanding the 2A;-th power: 

k k 2k / \ 



(5.7) 



j=0 i=0 i=0 

2k ^ , s 

'^'^ \ 2k~2t2k-t„.t 



. t 
t=0 



E I . ]w-^ '^^"^ ^y 




But ^"Lo Cm = unless ?7i | r, in which case it equals m. Since the only multiple of 
k + 1 va. the set {fc — t : < t < 2k} occurs for t = k, (I5.7p reduces to the left-hand 
side of (15.61) . We now show that these are all the minimal C-representations of /. 

Since H^i^x^y'^) has I's on the NE-SW diagonal, it is non-singular, so Lc{x^y^) > k, 
and Lc{x^y^) = A; + 1 by (15.61) . By Corollary 4.3, any minimal C-representation not 
given by (15. 6p can only use powers of forms which are distinct from any wx + w~^y. 
If ah = c^ ^ 0, then ax + hy is a, multiple of -x + -y. This leaves only x'^^ and y"^^, 
and there is no linear combination of these giving x^y^. D 

The representations in (15. 6p arise because the null- vectors of Hk+i{x^y^) can only 
be (co, 0, . . . , 0, Cfe+i)* and coa;'^+^ + cj.+iy^'^^ is a Sylvester form when coCfe+i ^ 0. 

Corollary 5.6. For k>l, Lc{{x^ + y'^f) = k + l, and Lk{{x^ + y'^f) = k + I iff 
tan j^ G K . The C-minimal representations of (x^ + y^)'^ are given by 

(5.8) Q(x^ + yy = ^EM^ + ^)^ + ^i^(^ + %)'' ^^C. 

Proof The invertible map {x,y) H- {x — iy,x + iy) takes x^y'' into (x^ + ?/^)'^. Setting 
7^ w = e*^ in (15.61) gives (15.81) after the usual reduction. If tana ^ 0, then 

(cos a X + sin a y^^' = cos^*" a ■ (x + tana yY = (1 + tan^ a)~^'(x + tana yY- 

Thus, (cos a x + sin a yY^ G K[x,y] iff cos a = or tana G iC. It follows that 
Lxiix"^ + y^)'^) = fc + 1 if and only if there exists G C so that for each < j < 
/c, either cos(^^ + 6^) = or tan(^^ + 6*) G K. Since tan a, tan /3 G K imply 
tan(a — (3) ^ K and fc > 1, we see that (15. 8p is a representation over if if and only 
if tan j^eK. D 
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In particular, since tan | = a/3 ^ Q, Lq((x^ + ?/^)^ > 3 and so must equal 4. Thus, 
C((x^ + y'^Y) = {3, 4}, as promised. Since tan ^ is irrational for m > 5 (see e.g. [351 
Cor.3.12]), it follows that Lq{{x^ + y'^f) = k + l only for A; = 1, 3. 

It is worth remarking that x^y^ is a highly singular complex form, as is (x^ + y'^Y- 
However, as a rea/ form, {x'^ + y'^Y is in some sense at the center of the cone Q2,2k- For 
real 6, the formula in (15.81) goes back at least to Friedman [16] in 1957. It was shown 
in [12] that all minimal real representations of (x^ + |/^)^ have this shape. There is an 
equivalence between representations of {x"^ + y"^)^ as a real sum of 2/c-th powers and 
quadrature formulas on the circle - see [12]. In this sense, (15. 8p can be traced back 
to Mehler [30] in 1864. Taking fc = 7, ^ = and p := tan f = ^2 - 1 in (jEHD gives 



+ 



i|(a:^ + yy = x^' + y'' + ^, ((x + yf + {x - yf) 
'-^)\{x + pyf + {x - pyf' + {px + yf^ + {px ~ yf) 



A real representation (II. ip of i^xj)^ (with positive real coefficients \j) is called a 
Hilbert Identity; Hilbert [2T1 HB] used such representations with rational coefficients 
to solve Waring's problem. Hilbert Identities are deeply involved with quadrature 
problems on S"""^, the Delsarte-Goethals-Seidel theory of spherical designs in com- 
binatorics and for embedding questions in Banach spaces [^ Ch.8,9], as well as for 
explicit computations in Hilbert's 17th problem [13]. It can be shown that any such 
representation requires at least ("^__7 ) summands, and this bound also applies if 
negative coefficients \j are allowed. It is not known whether allowing negative co- 
efficients can reduce to the total number of summands. When i^xfY is a sum of 
exactly ("^_^ ) 2k-ih. powers, the coordinates of minimal representations can be used 
to produce tight spherical designs. Such representations exist when n = 2, 2k = 2, 
(n,2fc) = (3,4), {n,2k) = {u^ - 2,4) {u = 3,5), {n,2k) = {3v^ - 4,6) {v = 2,3), 
(n, 2k) = (24, 10). It has been proved that they do not exist otherwise, unless possi- 
bly (n, 2k) = (m^ — 2, 4) for some odd integer m > 7 or (n, 2k) = (3f ^ — 4, 6) for some 
integer v > A. These questions have been largely open for thirty years. It is also not 
known whether there exist {k,n) so that L-[i{{^ xf)^)) > Lc{{J2 xf)'") , although this 
cannot happen for n = 2. For that matter, it is not known whether there exists any 
/ e Qn,d so that Lr(/) > Lcif). 

We conclude this section with a related question: if fx{x,y) = x'^ + GXx^y'^ + y"^ 
for A e Q, what is Lq(/a)? If A < — |, then fx has four real factors, so -^q(/a) = 4. 
Since det H2ifx) = A - A^ Lcifx) = 2 for A = 0, 1, -1. The formula 



X 



4 + QXxY + y') = l {{x + y)' + (x - y)') + (1 - A)(x^ + y^ 



shows that -Lq(/o) = -^q(/i) = 2; 2f_i{x,y) = (x + iy)'^ + (x - iy)^ has Q-length 4. 

Theorem 5.7. Suppose A = f G Q, A^ ^ A. Then Lq(x^ + 6\x^y^ + y^) = 3 if and 
only if there exist integers (m, n) ^ (0, 0) so that 

(5.9) r(a, 6, m, n) = 4a^6 m^ + (6^ - 6a^6^ - 3a^)m'^n'^ + ia^b n^ 
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IS a non-zero square. 



Proof. By Corollary 2.2, such a representation occurs if and only if there is a cubic 
h{x, y) = ^j=o CiX^~^y'' which splits over Q and satisfies 

(5.10) C0 + AC2 = ACi+Cg = 0. 

Assume that h{x, y) = {mx + ny)g{x, y), (m, n) ^ (0, 0) with m, n G Z. If g{x, y) = 
rx'^ + sxy + ty'^, then cq = mr, c\ = ms + nr, c^ = mt + ns, C3 = nt and (15.101) becomes 

/ \ I r\ / 

/^ -,-,\ I rn Xn Xm\ I \ /O 

(5.11) 



An Am n J \ j \0 

If 771 = 0, then the general solution to (15.111) is (r, s,t) = (r, 0, — Ar) and rx^ — Xry'^ 
splits over Q into distinct factors iff A is a non-zero square; that is, iff ab is a square, 
and similarly if n = 0. Otherwise, the system has full rank since A^ 7^ 1 and any 
solution is a multiple of 

(5.12) rx'^ + sxy + ty'^ = {Xn^ — X^m'^)x^ + (A^ — l)mnxy + {Xrn^ — X^n'^)y'^. 
The quadratic in (15.121) splits over Q into distinct factors iff its discriminant 

(5.13) 4A^m^ + (1 - 6A^ - ^X'^)m^n^ + 4A^n^ = r^r(a, 6, m, n) 

is a non-zero square in Q. D 

In particular, we have the following identities: r('u^,f^,f ,«) = {u^v — uv^Y ^^^ 
V{uv,v? — uv + f ^, 1, 1) = {u — v)^{u + vY, hence Lq(/a) = 3 for A = r^ and 
A = ^2^^, I , where r = ^ G Q, r 7^ ±1. These show that Lq(/a) = 3 for a dense set 
of rationals in [— ^ 00). These families do not exhaust the possibilities. If A = y, so 
fx{x, y) = x^ + 76x^2/^ + y^, then A is expressible neither as r^ nor ^gj^^^^^ for r G Q, 
but r(38, 3, 2, 19) = 276906^. 

We mention two negative cases: if A = |, r(l,3,m, n) = 12(m^ + n^)^, which is 
never a square, giving another proof that Lq((x^ + y'^Y) =4. If A = |, then 

r(l, 2, m, n) = 8m^ - llmV + 8n^ ='^{rr? - n^f + f (m^ + n^f, 

hence if Lq{x^ + 'ix^y'^ + y'^) = 3, then there is a solution to the Diophantine equation 
27X^ -|- bY"^ = Z^. A simple descent shows that this has no non-zero solutions: 
working mod 5, we see that 2X^ = Z^; since 2 is not a quadratic residue mod 5, it 
follows that 5 \ X, Z, and these imply that 5 | F as well. 

Solutions of the Diophantine equation Am'^ + Bm^ri^ + Cn^ = r^ were first studied 
by Euler; see [H] [pp. 634-639] and [33] [pp. 16-29] for more on this topic. This equation 
has not yet been completely solved; see [H |9]. We hope to return to the analysis of 
(15.91) in a future publication. 
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6. Open Questions 

Conecture 4.12 seems plausible, but as the degree increases, the canonical forms 
become increasingly involved. Are there other fields besides C (and possibly R) for 
which there is a simple description of {/ : Lxif) = deg/}? 

Which cabinets are possible? Are there other restrictions beyond Corollary 5.1(1)? 
How many different lengths are possible? If |C(/)| > 4, then d >7. 

Can / have more than one, but a finite number, of i^-minimal representations, 
where K is not necessarily equal to Efl Theorem 5.7 might be a way to find such 
examples. 

Length is generic over C, but not over R. For d = 2r, the R-length of a real form 
is always 2r in a small neighborhood of n-;=i(^ ~ Jy)y but the R-length is always 
r + 1 in a small neighborhood of [x"^ + y'^Y, by [12] • Which combinations of degrees 
and lengths have interior? Does the parity of d matter? 



[1] 
[2] 
[3] 
[4] 
[5] 
[6] 
[7] 
[8] 

[9] 
[10] 

[11] 
[12] 

[13] 
[14] 



References 

J. Alexander and A. Hirschowitz, Polynomial interpolation in several variables, J. Algebraic 

Geom., 4 (1995), 201-222, MR1311347 (96f:14065). 

M. Brambilla and G. Ottaviani, On the Alexander- Hirschowitz theorem, J. Pure Appl. Algebra, 

212 (2008), 1229-1251, arxiv-m ath/070 1409, MR2387598 (2008m:14104). 

E. Brown, x^ + dx^y^ + 2/^ = z'^ ■' some cases with only trivial solutions — and a solution Euler 

missed, Glasgow Math. J., 31 (1989), 297-307, MR1021805 (91d:11026). 

E. Carlini, Varieties of simultaneous sums of power for binary forms, Matematiche (Catania), 



57 (2002), 83-97, arxiv-niath.AG/0202050| MR2075735 (2005d:11058). 



E. Carlini and J. Chipalkatti, On Waring's problem for several algebraic forms. Comment. 
Math. Hclv., 78 (2003), 494-517, arxiv-math .AG/0112110, MR1998391 (2005b:14097). 
M. D. Choi, Z. D. Dai, T. Y. Lam and B. Reznick, The Pythagoras number of some affine 
algebras and local algebras, J. Reine Angew. Math., 336 (1982), 45-82, MR0671321 (84f:12012). 
M. D. Choi, T. Y. Lam, A. Prestel and B. Reznick, Sums of 2mth powers of rational functions 
in one variable over real closed fields. Math. Z., 221 (1996), 93-112, MR1369464 (96k:12003). 
M. D. Choi, T. Y. Lam and B. Reznick, Sums of squares of real polynomials, _?C-theory and 
algebraic geometry: connections with quadratic forms and division algebras (Santa Barbara, 
CA, 1992), 103-126, Proc. Sympos. Pure Math., 58, Part 2, Amer. Math. Soc, Providence, 
RI, 1995, MR1327293 (96f:11058). 

J. H. E. Cohn, On the Diophantine equation z^ = x'^ -\- Dx^y^ + y'^, Glasgow Math. J., 36 
(1994), 283-285, MR1295501 (95k:11035). 

P. Comon and B. Mourrain, Decomposition of quantics in sums of powers of linear forms. Signal 
Processing, 53 (1996), 93-107. 

L. E. Dickson, History of the Theory of Numbers, vol U: Diophantine Analysis, Carnegie Insti- 
tute, Washington 1920, reprinted by Chelsea, New York, 1971, MR0245500 (39 #6807b). 
1. Dolgachev and V. Kanev, Polar covariants of plane cubics and quartics. Adv. Math., 98 
(1993), 216-301, MR1213725 (94g:14029). 

R. Ehrenborg and G.-C. Rota, Apolarity and canonical forms for homogeneous polynomials, 
European J. Combin., 14 (1993), 157-181, MR1215329 (94e:15062). 

W. J. Ellison, A 'Waring's problem' for homogeneous forms, Proc. Cambridge Philos. Soc, 65 
(1969), 663-672, MR0237450 (38 #5732). 



22 BRUCE REZNICK 

[15] W. J. Ellison, Warmg's problem, Amer. Math. Monthly, 78 (1971), 10-36, MR0414510 (54 

#2611). 
[16] A. Friedman, Mean-values and polyharmonic polynomials, Michigan Math. J., 4 (1957), 67-74, 

MR0084045 (18,799b). 
[17] A. Geramita, Inverse systems of fat points: Waring's problem, secant varieties of Veronese 

varieties and parameter spaces for Gorenstein ideals, The Curves Seminar at Queen's, Vol. X 

(Kingston, ON, 1995), 2-114, Queen's Papers in Pure and Appl. Math., 102, Queen's Univ., 

Kingston, ON, 1996, MR1381732 (97h:13012). 
[18] S. Gundelfinger, Zur Theorie der bindren Formen, J. Reine Angew. Math., 100 (1886), 413-424. 
[19] J. Harris, Algebraic geometry. A first course. Graduate Texts in Mathematics, 133, Springer- 

Verlag, New York, 1992, MR1182558 (93j:14001). 
[20] U. Helmke, Waring's problem for binary forms, J. Pure Appl. Algebra, 80 (1992), 29-45, 

MR1167385 (93e:11057). 
[21] D. Hilbert, Beweis fur die Darstellbarkeit der ganzen Zahlen durch eine feste Anzahl n-ter 

Potenzen (Waringsches Problem), Math. Ann., 67 (1909), 281-300, Ges. Abh. 1, 510-527, 

Springer, Berlin, 1932, reprinted by Chelsea, New York, 1981. 
[22] P. Holgate, Studies in the history of probability and statistics. XLI. Waring and Sylvester on 

random algebraic equations, Biometrika, 73 (1986), 228-231, MR0836453 (87m:01026). 
[23] A. larrobino and V. Kanev, Power Sums, Gorenstein algebras, and determinantal loci, Lecture 

Notes in Mathematics, 1721 (1999), MR1735271 (2001d:14056). 
[24] S. Karlin, Total Positivity, vol. 1, Stanford University Press, Stanford, 1968, MR0230102 (37 

#5667). 
[25] J. P. S. Kung, Gundelfinger's theorem on binary forms. Stud. Appl. Math., 75 (1986), 163-169, 

MRG859177 (87m:11020). 
[26] J. P. S. Kung, Ganonical forms for binary forms of even degree, in Invariant theory. Lecture 

Notes in Mathematics, 1278, 52-61, Springer, Berlin, 1987, MR0924165 (89h:15037). 
[27] J. P. S. Kung, Canonical forms of binary forms: variations on a theme of Sylvester, in Invariant 

theory and tableaux (Minnesota, MN, 1988), 46-58, IMA Vol. Math. Appl., 19, Springer, New 

York, 1990, MR1035488 (91b:11046). 
[28] J. P. S. Kung and G.-C. Rota, The invariant theory of binary forms, Bull. Amer. Math. Soc. 

(N. S.), 10 (1984), 27-85, MR0722856 (85g:050G2). 
[29] J. M. Landsberg and Z. Teitler, On the ranks and border ranks of symmetric tensors. Found. 

Comp. Math., 10 (2010), 339-366, 'SXw:0901. 0487, 
[30] G. Mehler, Bemerkungen zur Theorie der mechanischen Quadraturen, J. Reine Angew. Math., 

63, (1864), 152-157. 
[31] R. Miranda, Linear systems of plane curves, Notices Amer. Math. Soc, 46 (1999), 192-202, 

MR1673756 (99m: 14012). 
[32] L. J. Mordell, Binary cubic forms expressed as a sum of seven cubes of linear forms, J. London. 

Math. Soc., 42 (1967), 646-651, MR0249355 (40 #2600). 
[33] L. J. Mordell, Diophantine equations, Academic Press, London-New York 1969, MR0249355 

(40 #2600). 
[34] D. J. Newman and M. Slater, Waring's problem for the ring of polynomials, J. Number Theory, 

11 (1979), 477-487, MR0544895 (80m:10016). 
[35] I. Niven, Irrational numbers. Cams Mathematical Monographs, No. 11. Math. Assoc. Amer., 

New York, 1956, MR0080123 (18,195c). 
[36] G. Polya and I. J. Schoenberg, Remarks on de la Vallee Poussin means and convex conformal 

maps of the circle. Pacific J. Math., 8 (1958), 295-234, MR0100753 (20 #7181). 
[37] G. Polya and G. Szego, Problems and theorems in analysis, II, Springer- Verlag, New York- 
Heidelberg 1976, MR0465631 (57 #5529). 



[38; 

[39 

[4o; 

[41 

[42 

[43; 

[44; 

[45; 

[46' 
[47; 

[48; 

[49; 
[50' 

[51 



ON THE LENGTH OF BINARY FORMS 23 

V. Powers and B. Reznick, Notes towards a constructive proof of Hilbert's theorem on ternary 
quartics, Quadratic forms and their applications (Dublin, 1999), 209-227, Contenip. Math., 
272, Amer. Math. Soc, Providence, RI, 2000, MR1803369 (2001h:11049). 
K. Ranestad and F.-O. Schreycr, Varieties of sums of powers, J. Reine Angew. Math., 525 
(2000), 147-181, MR1780430 (2001ni:14009). 

B. Reichstcin, On expressing a cubic form as a sum of cubes of linear forms, Linear Algebra 
AppL, 86 (1987), 91-122, MR0870934 (88c:11022). 

B. Reichstein, On Waring's problem for cubic forms, Linear Algebra AppL, 160 (1992), 1-61, 
MR1137842 (93b:11048). 

B. Reznick, Sums of even powers of real linear forms, Mem. Amer. Math. Soc, 96 (1992), no. 
463, MR1096187 (93h:11043). 

B. Reznick, Uniform denominators in Hilbert's Seventeenth Problem, Math. Z., 220 (1995), 
75-97, MR1347159 (96e:11056). 

B. Reznick, Homogeneous polynomial solutions to constant coefficient PDE's, Adv. Math., 117 
(1996), 179-192, MR1371648 (97a:12006). 

B. Reznick, Laws of inertia in higher degree binary forms, Proc. Amer. Math. Soc, 138 (2010), 
815-826, arXiv - 0906.5559, MR26566547. 
B. Reznick, Blenders, in preparation. 

G. Salmon, Lesson introductory to the modern higher algebra, fifth edition, Chelsea, New York, 
1964. 

J.J. Sylvester, An Essay on Canonical Forms, Supplement to a Sketch of a Memoir on Elimina- 
tion, Transformation and Canonical Forms, originally published by George Bell, Fleet Street, 
London, 1851; Paper 34 in Mathematical Papers, Vol. 1, Chelsea, New York, 1973. Originally 
published by Cambridge University Press in 1904. 

J. J. Sylvester, On a remarkable discovery in the theory of canonical forms and of hyperdeter- 
minants, originally in Phiosophical Magazine, vol. 2, 1851; Paper 42 in Mathematical Papers, 
Vol. 1, Chelsea, New York, 1973. Originally published by Cambridge University Press in 1904. 
J. J. Sylvester, On an elementary proof and demonstration of Sir Isaac Newton's hitherto 
undemonstrated rule for the discovery of imaginary roots, Proc. Lond. Math. Soc. 1 (1865/1866), 
1-16; Paper 84 in Mathematical Papers, Vol.2, Chelsea, New York, 1973. Originally published 
by Cambridge University Press in 1908. 

J.-C. Yakoubsohn, On Newton's rule and Sylvester's theorems, J. Pure AppL Algebra, 65 (1990), 
293-309, MR1072286 (91j:12002). 



Department of Mathematics, University of Illinois at Urbana-Champaign, Urbana, 
IL 61801 

E-mail address: reznickOmath . uiuc . edu 



